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In his article G6tzc[9] used Stein's method to provide an ingenious derivation of the Berry-Esseen type bound for 
the class of Borel convex subsets of R fe in the context of the classical multivariate central limit theorem (CLT). This 
approach has proved fruitful in deriving error bounds for the CLT under certain structures of dependence as well (see 
Rinott and Rotar [11]). Our view and elaboration of Gotze's proof resulted from a collaboration between the authors 
and were first presented in a seminar at Stanford given by the first author in the summer of 2000. The authors wish 
to thank Persi Diaconis for pointing out the need for a more readable account of Gotze's result than given in his 
original work. 

After an explanation of the general method in Section 1, detailed derivations of various estimates are given in 
Sections 2-4 in terms that would be reasonably familiar to probabilists. Except for the smoothing inequality in Section 
4, which is fairly standard, complete proofs are given. 

Recently Raic[10] has followed essentially the same route as Gotze, but in greater detail, in deriving Gotze's bound. 
It may be pointed out that we were unable to verify the dimensional dependence O(k) in [9], [10]. Our derivation 
provides the higher order dependence of the error rate on k, namely O(ki). This rate can be reduced to 0{ki) using 
an inequality of Ball [1]. The best order of dependence known, namely, O(k^) is given by Bentkus[3], using a different 
method, which would be difficult to extend to dependent cases. 

As a matter of notation, the constants c, with or without subscripts are absolute constants. The /c-dimensional 
standard Normal distribution is denoted by A/"(0,Ife) as well as $, with density <fi. 

1.1. The Generator of the ergodic Markov process as a Stein operator. 

Suppose Q and Qo are two probability measures on a measurable space (S, S) and h is integrable (with regards to 
Q and Qo). Consider the problem of estimating 



A basic idea of Stcin[12] (developed in some examples in [7] and [8]) is 

(i) to find an invertible map L which maps "nice" functions on S into the kernel or null space of Eq, 

(ii) to find a perturbation of L, say L a , which maps "nice" functions on S into the kernel or null space of _B, 

(iii) to estimate 1.1 using the identity 



(1.1) 




(1.2) 



Eh - E h = ELg = E(Lg - L a g a ) 



where 



g a = L- l {h-E Q h), 



g a = L- 1 (h-Eh). 
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One way to find L is to consider an ergodic Markov process {X t : t > 0} on S which has Qo as it's invariant 
distribution, and let L be its generator: 

(1.3) Lg = lim t9 ~ 9 ) geV L 
where the limit is in L 2 (S, Qo) , and 

(T t g)(x)=E[g(X t )\X =x], 
or in terms of the transitions probability p(t; x, dy) of the Markov process {Xt : t > 0} , 

(1.4) (T t g)(x) = I g{y)p(t; x, dy) [x G S, t > 0). 



Also T>l is the set of g for which the limit in (1.3) exists. By the Markov (or, semigroup) property, Tt+ S = TtT s = T s Tt, 
so that 

(1.5) ±T t g = lirn T ^9-T t g = ^ T t {T.g - g) = 

at sio s sio s 

Since T f T s = T s T tl T t and L commute so that 

(1-6) jT t9 - LT t g. 

Note that invariance of Qo means ET t g(Xo) = Eg(Xo) = J gdQo, if the distribution of Xq is Qo- This implies that, 
for every g € T>l, ELg(X ) — 0, or 



Lg(x)dQ (x) = 0, 



ELg(X ) = E(lim ^{X Q ) - g(X ) } = ^ ET t9 (X ) - Eg(X o) 



'tlo t tlo t 

That is, L maps T>l into the set l 1 - of mean zero functions in L 2 (S, Qo). It is known that the range of L is dense in 
l 1 - and if L has a spectral gap, then the range of L is all of 1 . In the latter case L~ x is well defined on l 1 - (kernel 
of Qo) and is bounded on it ([4]). 

Since T t converges to the identity operator as t 1 one may also use T t for small t > to smooth the target 
function h = h — J hdQo- For the case of a diffusion {X t : t > 0}, L is a differential operator and even non smooth 
functions such as h = 1b — Qo(B)(h = 1b) are immediately made smooth by applying T t . One may then use the 
approximation to h given by 

(1.7) T t h = L{L~ l T t h) = Ltp t , with Vt = L~ X TtK 

and then estimate the error of this approximation by a "smoothing inequality" , especially if T t h may be represented 
as a perturbation by convolution. For several perspectives and applications of Stein's method see [2], [7], [8], [11]. 



1(b) The Ornstein- Uhlenbeck Process and its Gausssian invariant Distribution 

The Ornstein-Uhlenbcck (O-U) process is governed by the Langevin equation (see, e.g. [6, pp. 476, 597, 598]) 
(1.8) dX t = -X t dt + V2dB t 
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where {B t : t > 0} is a fc-dimensional standard Brownian motion. Its transition density is 

k i t \2 

(1.9) p(t; j/) = J] [2tt(1 - e- 2t )] "* exp{- ^. ~_ e } x = (x u . . . , x k ), y = (yi, . . . , y k ). 

This is the density of a Gaussian (Normal) distribution with mean vector e~ t x and dispersion matrix (1 — e~~ 2t )\k 
where 1^ is the fc x fc identity matrix. One can check (e.g., by direct differentiation) that the Kolmogorov backward 
equation holds: 

Mpy) = » dMt; x y) _y Mpy) = Ap _ x . Vp = Lp7 wit hL = A-x.v 

at ^ oxf ^ oxi 

i—l L 1=1 

where A is the Laplacian and V = grad. Integrating both sides w.r.t. h(y)dy we see that T t h(x) = J h(y)p(t; x, y)dy 
satisfies 

d 

(1.11) —T t h(x)=AT t h{x)-x»VT t h(x) = LT t h(x), V/i e L 2 (R fe , $). 

Now on the space L 2 (R fe ,<l>) (where $ = iV(0,Ifc) is the fc-dimensional standard Normal ), L is self adjoint and 
has a spectral gap, with the eigenvalue corresponding to the invariant distribution $ (or the constant function 1 
on L 2 (R fc ,$)). This may be deduced from the fact that the Normal density p(t;x,y) (with mean vector e~ t x and 
dispersion matrix (1 — e -2 *)!^) converges to the standard Normal density <f>(y) exponentially fast as t — > oo, for 
every initial state x. Else, one can compute the set of eigenvalues of L, namely {0, — 1, —2, . . .}— with eigenfunctions 
expressed in terms of Hermitc polynomials [6, page 487]. In particular, L~ x is a bounded operator on l -1- and is given 
by 

(1.12) L" 1 /i = -^ T s h{x)ds, Vh = h- J hd<f> £ L 2 (K fe ,$). 
To check this, note that by (1.11) 

poo o />oo / poo 

(1.13) h=- —T s h(x)ds = - LT s h(x)ds = L ( -/ T s h(x)d 

Jo " s Jo V Jo 

For our purposes h = lc- the indicator function of a Borel convex subset C of It . 

A smooth approximation of h is Tth for small t > (since T t h is infinitely diffcrcntiable). Also, by (1.12) 



(1.14) ip t (x) = L^Tthix) = I T s T t h(x)ds = - J T s+t h(x)ds = - J T s h{x)ds 

{ / h(e~ s x + \J\ - e- 2s z)(f)(z)dz}ds 
Jm k 

where <j> is the fc-dimensional standard Normal density. We have expressed T s h{x) = E[h(X s )\Xo = x] in (1.14) as 

(1.15) E[h(X s )\X =x]= Eh( e - S x + sjl - e~ 2s Z), 



where Z is a standard Normal JV(0, Ife). For X s has the same distribution as e s x + s/l — e 2s Z. Now note that 
using (1.14), one may write 

(1.16) T t h(x) = LiL^Tthix)) = AiL^Tthix)) -x* V(L" 1 T t /i(x)) = Aip t ( x ) ~ xu W> t (x). 
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For the problem at hand (see 1.1) Qo = $ and Q = Q( n ) is the distribution of S n = -^(Yj. + Y2 + • • • + Y n ) = 
(Xi + X-2 + • • • + X n ), (Xj = Yj/s/n), where Yjs are i.i.d. mean-zero with covariance matrix and finite absolute 
third moment 

k 

P3 = i?||r 1 || 3 = i?(V(r 1 w ) 2 )i. 



1=1 



We want to estimate 



(1.17) 



E~h(S n ) = Eh(S„) - / hd$ 



for h = lc, C E C— the class of all Borel convex sets in M. k . 
For this we first estimate (see (1.16)), for small t > 0, 



(1.18) 



ETth(S n ) = E [AMSn) - S n • VMSn 



This is done in Section 3. The next step is to estimate, for small t > 0, 



(1.19) 



ET t h(S n ) - Eh(S n ) 



which is carried out in Section 4. Combining the estimates of (1.18) and (1.19), and with a suitable choice of t > 0, 
one arrives at the desired estimation of (1.17). 
We will write 



(1.20) 



S n = sup I / hdQ(n) — / hd<&\ 

{h=l c .C<£C} J J 



2. Derivatives of xjjt = L 1 Tth 



Before we engage in the estimation of (1.18) and (1.19), it is useful to compute certain derivatives of ipt- 



Let D 4 = 



d 

dxi ' 



Da, = 



2 



dxidxi 



Da, i„ = 



dxidxi'dxi" 



, etc 



Then, using (1.14), 

Diip t (x) = 



" e- 2s ))^ iy : '"y e- • exp{- |- e Zf, V 



%)(27r(l-e-^)) 



1 - e 

k e~ 



2(1 - e~ 2s ) 



ds 



{yi - e s x t ) 1 12/ — e s x\\ 2 ' 

ex P{-^7i — z^7r}dy 



2s _ e -2s 



2(1 - e- 



ds 



(2.1) 



(- 



it Vl-e" 2s " 
using the change of variables 



h(e s x + yl — e 2s z)zi4>(z)dz 



ds, Zi4>(z) = -—<f>(z)) = -Di<f>(z) 

UZi 



y — e s x 
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In the same manner, one has, using D^D^i, etc for derivatives 



a 



dzi ' dzidz-r 1 1 



etc, 



(2.2) 



Du>i„%j) t {x) 



vr 



h(e~ s x + e- 2a z) ■ D iV <\>{z)dz 



(- 



-2s 



it 'x/l - e 

The following estimate is used in the next section 



h(e~ s x + \Jl - e- 2s z) ■ {-D wv ,<j)(z))dz 



ds. 



(2.3) 



sup 

u€l k 



n-1 



+ y/l — e~ 2s z)<j>{x)Dnnii 4>(z)dxdz 
<c ke 2s (l-e- 2s ). 

To prove this, write a = ^/^3je s \/l — e~ 2s and change variables x — > y = x + az. Then 

k „i 

(2.4) 0(x) = 4>(y — az) — (f>(y) — az ■ V0(y) + a 2 z r z r > / (1 — v)D rr i(f>(y — vaz)dv, 

r,r — 1 

SO that 



h(\ -e~ s x + e~ s u + y/l - e~ 2s z) = h( 



n- 1 



e s y + e s u), 



and the double integral in (2.3) becomes 
(2.5) 



h(\l e s y + e s u) 4>(y) — az ■ V 4>(y) + a 2 z r z r i j (1 — v)D rr i<f>(y — vaz)dv 

R k V Tl r ,r'=X ^° 

Note that the integrals of Da>ii>(f>(z) and Zi Du>i»<fi(z) vanish for i,i',i",io, so that 
2.6) 



Du'i"(f)(z)dzdy 



n - 1 



r<= V n 



e s y + e s u)(<j)(y) — az ■ V 4>(y))DuH" 4>(z)dz = 



The magnitude of the last term on the right in (2.4) is 



(2.7) 



a 2 I (1- v) 
o 



z r z r i(y — vaz) r (y — vaz) r > — zj: 



r,r' — 1 



< a 2 / (1 - «) 



^ z r z r ,(y - vaz) r (y- 



vaz ) 



4>(y — avz)dv 



4>{y — avz)dv, 



r,r'— 1 r— 1 

since the sum JZrr' a bove is nonnegative. Bounding \h\ by 1, it follows from (2.5)-(2.7) that the left side of (2.3) is 
no more than 

r.1 I t. k „ k 



a 2 (1 — v)< / > z r z r i / (y — vaz) r (y — vaz) r i(f>(y — vaz)dy+} z 2 / {(y — vaz) 2 , + l}4>(y — avz)dy 
Jo Ur" Jm* Jr« 

(2.8) 

-a 2 [ (l-v)\ [ 2y2z 2 \D ltH „ ( j > (z)\dz\dv, 

JO {JR k r=1 J 

from which (2.3) follows. 



I A. 



(z)|dz > dv 
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3. Estimation of T t h (S n ) 



By (1.16), 

(3.1) T t h(S n ) = HL^TthjiSn) = LMSn) = AMSn) - S n . WMSn) 



Consider the Taylor expansions 

k 



k ,i 



i=l i=l i,i'=l ^° 



n k 



S n • VMSn) =J2 X r V MSn) = E E X 5 ■ D MSn) 
3=1 3=1 i=l 



(3.2) 



= E 

3=1 



^ • AVt(5„ - x,) + J2 xfxpDiMSn - X 3 ) 



i,i'=l 



V X^xj ^ > / (1 - v)D iili n^ t {S n - X 3 + vX 3 )di 
i,V,V=l Jo 



Recalling that X,- = ^L, EY 3 = 0, £^ W xj° = ^EY^Yp = ±6u> and X, and S n - X 3 are independent, 



(3.3) 



EAMSn) = E 



Y.DuMSn-Xi, 



2 = 1 



., =1 V" 



(3.4) 

#s„- vMSn) = e 

Hence 
(3.5) 

ET t h(S n ) = E 



J^DMSn-Xr) 



4= E E W ] YpYP [\l - v)D m „MSn -Xi + vX 1 )dv 



fe y(0 /-l 

5] / D iU itp t (S n -X 1+ vX ± )d 



i } i'= 1 

One may then write 
(3.6) 



i>-4= E ^yfVf) f\l-v)D iVi „i> t {S n -X 1 +vX 1 )d% 



ET t h{S n ) = E[E{» • •\Y 1 )] 



where • • • is the quantity within square brackets in (3.5), i.e., 

k 

(3.7) ElTthiSM] = ^= J] 1 I E [DuMSn - X x + vX 1 )\Y 1 ] dv 



^pi: 

4= E Y®YpYp ( (l-v)E[D m „i >t (S n -X 1 +vX 1 
^ m'1^=i Jo 



)\Yi]dv 
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The first term on the right side in (3.7) equals 



— V y ( 



(- 



t Vl-e- 2s ' 



/i(e" s (S*„ - Xi) + e" s j;li + Vl -e~ 2s z) • (-Aii"Kz))dz|Yi | dv^ d 



(3.8) 



OO 3 



JVL k 



h{e 



n- 1 



x + e "uXi + \/i - e- 2s z)dQ {n _ 1) (x) 



D ui '<f>(z)dz> dv\ds, 



noting that the distribution of S„ - X\ = \/ ( r2+ ^+" +Y " ) is that of w ^jpV, where V has distribution Q(„_i). 



Therefore, (3.8) is equal to 
(3.9) 



Vl-e- 2s ' 



l-/R fc 



n — 1 



+ e- s uX! + Vl-e- 2s z)(d(Q (n _ 1) (a;) - $(x)) + d$(ar)) 



■Dmi(j)(z)dz >dvds 



Since the class of functions h = lc, where C ranges over all Borel convex subsets of R fe , is invariant under 
translation, and bC is convex if C is convex (bC = {bx : x £ C},V6 > 0), 



(3.10) 



'\j^^ x + e- s vX l + v / l-e- 2s z)(d(Q ( „_ 1) (a:) - 



< #n-l- 



Similarly, the second term on the right in (3.7) equals 



(3.H) -±= x: y^yPyp r\ 

d(Q(„_i)(x)-$(x)) + d$(s) 

Again, the inner integral in (3.11) with regard to Q( n -i) — $ is estimated by (3.10). Therefore, using (2.3) for the 
remaining integration with regard to $ in (3.8), (3.11). 

k 



n - 1 



+ e~ s vXi + \J\- e~ 2s z) 



.Duri'r(j>(z)dz >dv 



ET t h(S n ) 
(3.12) 



< 



1 



E*i y i i / ( 



Vl-e- 2s ' 



5 n _! / |A^^)|dz + c fce 2s (l-e- 2s ) 



c/.s 



1 



+ ^ V E 



( 



Vl - e" 



2* • 



Next, the first two terms on the right in (3.12) may be estimated by using 



Sn-i [ I (1 - v)dv I • / \D ij , i „<f>(z)\dz+ c ke 2s (l - e~ AS ) 



-2s^ 



ds 



(3.13) 



/ \Dn>iti$(z)\dz 



f \D m ^(z)\dz = E\Z^Zpzf 



E\(Z^) 2 - 1| • E\Zp\ <1 = iori', 

E\{Z^f-Z^\ < y/E Vi = i' = i", 



< 1 ii i,i ,i are all distinct. 
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Finally, note that 
so that 

poo -s poo -s 

(3.15) / j: _^ ds = c , <^, I ( H = fds<{2ty 



o VI - e" 2s J t "VT^ 



e 



Hence, using (3.12)-(3.15), together with the estimates 

k k 



one has 



(3-16) |i?W n )| < Cl fc3/V 3 (%±=) + 



/ray* n 

4. The smoothing inequality and the Estimation of S n 

Let H = {ICS C G C}, where C is the class of all Borcl convex subsets of R fc . As before, h — h — J hd&. We also write 
G b as the distribution of bW, if W has distribution G(6 > 0). Recall that (sec 1.15) T t h(x) = M(e"*x+Vl - e~ 2t Z), 
where Z has the standard Normal distribution $ = ./V(0, which we take to be independent of S n . Then 

ET t h{S n ) = Ehie^Sn + y/l - e~ 2t Z) = [ [ h(e-*x + \/T~P^z)dQ (n) {x)(f>(z)dz 

(4.1) =/ hd((Q (n) ) e -t*<f> VT ^= w )= fed((Q (n) ) e -*-$ e -i)*$ vl=5 = m 



The introduction of the extra term $ e -t * <1> v / 1 _ e -at = $ does not affect the integration in the last step since 
J Rk hd$ = 0. 

Since the last integration is with respect to the difference between two probability measures, its value is unchanged 
if we replace h by h. Hence 



(4.2) ET t h(S n ) = / hd[Q (n) ) e - t - $ e - t ] *^ VT ^=w . 

Also the class C is invariant under multiplication C — > bC where b > is given. Therefore, 

(4.3) S n = sup \Eh(S n )\ = sup | / hd(Q {n) - = sup | / hd \{Qt n )) e -* - * e -'l • 

hen hen J hen J 

Thus (4.2) is a perturbation (or, smoothing) of the integral in (4.3) by convolution with $ v / 1 _ e -2t - If e > is a 
constant such that 

(4-4) $ v ^ r ({|z|< e }) = I, 

then the smoothing inequality below applies, with /i = (Q( n )) e -t, v = <f> e -t,K = $ v / 1 _ e -^ t , / = h = 1q, a = 7/8, 
and e as in (4.4). 
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Smoothing Inequality 



Let u, v, K be probability measures on R fe , K({x : \x\ < e}) = a > i.Then for every bounded measurable / one has 



(4.5) 
where, 



M/i - v) 



< 



(2a -l)- 1 [ 7 *(f:e)+u 1 }(2e:v)] 



f+(x) = sup{/(y) : \y - x\ < e}, f e (x) = inf{/(y) : \y - x\ < e}, 
and7(/:e) = max / \f+d(u-u)\, \f~d(u-i/)\ 



7*(/ : e ) = SU P 7(/j/ : e )> = /(^ + 2/)) 



Wf(x:e) = swp{\f(y) - f(x)\ :\y- x\ < e}, w/(e : v) = J u f (x : e)du(x), 
w*(/:e) = sup w/ v (e : j/). 

For a proof of the inequality (4.5) see Bhattacharya and Rao [5], Lemma 11.4. With h = lc one gets hf = lc 
h~ = lp-t, where C e — {x : dist(x, C) < e} ,C~ e = {x : open ball of radius e and center x is contained in C} are 
both convex, so that 

7 (/i : e) < max | J l c *-d [(Q(„)) e -* - * e -*] * ^VT^^' / [(Q(n))e-« - $ e -<] * $,713^} < sup | ET t h(S n ) 

Since C is invariant under translation one then obtains 



(4.6) 



7*(h ■ e) < sup | ET t h(S n ) | 



Also, letting Z be standard Normal iV(0, 



(4.7) 



W ;(2e:* e -0 = P(e-'Z e (DC)^) 

= P(Z 6 e*(<9C) 2e ) < csVk2ee*. 



where C3 > is a constant (see Bhattacharya and Rao [5], Theorem 3.1). From (4.4) one gets 



( y/l-e~ 2t Z < e) 



7 

<£]=-, 



P IZI < 



so that e/\/l — e _2t = a*, where satisfies P(|Z| < ojt) = |. It is simple to check that a fe = 0(y/k), as fc 
and 



00, 



(4.8) 



< aVk, e = afcVl - e" 2t < c 4 v / A:v / l-e^ < c 4 VkV2t 



Using this estimate of e in (4.9), one obtains 

(4.9) co* h (2e : $ e - t ) < cgfcVie* 
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The smoothing inequality now yields (use (4. 3), (4. 6), (4.9) in (4.5)) 

4 



(4.10) S n < g 

Now use (3.14) in (4.11) to get 



sup | ET t h(S n ) | +c 5 fcv / te t 
hen 



(4.11) S n < (c 6 k 3 / 2 p 3 )^ + + c^kVte 1 . 

By comparing the first and third terms on the right, an optimal order of t is obtained as 

J , \/kS n -ip3 1 
t = mm < 1, = > . 

I ^ J 

It follows that 

(A io\ X ^ I ,5/4 1/2x^-1 . C 7 k 3 / 2 p 3 

(4.12) S n < (c 9 fc 0/ p 3 < )— t- + 5-7^ — - 



Consider now the induction hypothesis : The inequality 
(4.13) <5„ < ^Vpa 



holds for some n > 1 and an absolute constant c > 1 specified below. Note that (4.13) clearly holds for n < c k 5 p 3 
Since c 2 k 5 p 3 > k 8 , suppose then (4.13) holds for some n = no > k s . Then by (4.12) we can take uq > k 3 : under the 
induction hypothesis, and (4.12), 

Cgv^fcf + fPa C 7 k 3/2 p 3 



((n (n + l))3 (n + 1)5 

c wy /ck2p 3 c 7 k 5 / 2 p 3 k- 1 9 9 

<- 7TT- + ^ 7TT cio = c 9 + l, — — - < k < 2 J , fork > 2). 

(n + 1)5 2 9 (n + l)5 «o + l 



(4.14) 



Now, choose c to be the greater of 1 and the positive solution of c = ciq^/c + c 7 , to check that (4.13) holds for 
n = no + 1. Hence (4.13) holds for all n. 
We have proved the following result. 

Theorem 1 There exists an absolute constant c > such that 
(4.15) 5 n < 

5. The Non-Identically Distributed Case 

For the general case considered in [9], Xj's (1 < j < n) are independent with zero means and Y^j=i CovXj = Ife. 
Assume 

(5.1) (3 3 ^ ^ll^ll 3 <^ 

l<j<n 
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Let {Xj : 1 < j < n} be an independent copy of {Xj : 1 < j < n}. Then, writing S„ = Xj, as before, 
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n k 



e^DuMSu) = eJ2 E Du,MSn)xy>x 

i—1 j—1 i,i' — l 



(5.2) = E 

and 
(5.3) 

E[S n -VMSn)]=E 



n k n k ~i 

Y^Y. ^'MSn-x^xfxp+Y: E xfxpxpl 

j — lid' — l j — li.i',i" — l 



Du'i"ipt(S n - Xj + vXj)dv 



^ X 3 .V^-X 3 )+^ xf ) D ii ^ t {S n -X j ) + E X f X f )x f ] j 0--v)D iVi nij }t {S n -X j +vX j )dr 

j—1 I i,i' — l — l 



Substracting (5.3) from (5.2) and noting that 



EXj ■ S7ipt(S n — Xj) = 0, 



one obtains 



ET t ~h(S„) = E 



(5.4) 



E E xfxpxp 

j=li,i f ,i"=l J ° 

E E xfxpxfU { 



Du'i»il>t{S n - Xj + vXj)dv 



1 - v)Dn,i»ip t (S n - Xj + vXj)dv 



The estimation of the conditional expectation of the integrals J„ in (5.4), given Xj, proceeds as in Section 3 (with 
Xj in place of -Xi). The only significant change is in the normalization in the argument of h (see (3.8) - (3.11)) where, 
writing Nj as the positive square root of the inverse of Cov(S n — Xj), 

(5.5) 



E 



h{e~ s (S n ~ Xj) + e - s vXj + VT - e~ 2s z\Xj 



E 



h(e- s N~\Nj(S n - Xj)) + e-\Xj + \/l - e~ 2s z\Xj 



h(e- s Npx + e~ s vXj + e~ 2s z)dQ {n _ lhj (x) = / h{e~ s Nr 1 (x + JV 3 eVl - e~ 2s z) + e- s vXj)dQ {n _ l)>j (x), 

where Q( n ) denotes the distribution of S n = J^™ Xj, and Q( n —i),j that of Nj(S n — Xj), which has mean zero, 
covariancc As in Section 3, the last integration is divided into two parts: <i(<5( n _i) J — $)(#) + d<fr(x). Since the 
class of Borel convex sets is invariant under non-singular affine linear transformations, the integral with regards to 
Q(n~i).j — $ is bounded by S„-i. For the integral with regards to $, we change variables x — > y = x + AjZ, where 
Aj = e~ s Vl — e~ 2s Nj. The estimation of the integral now proceeds as in (2.3)— (2.8), with scalar a replaced by the 
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matrix Aj. The effect of this is simply to change the sum a 2 ^ r , z r z r iD rr i(j)(y — vaz) in (2.4) to 

fc 

(Ajz) r (Ajz) r 'D rr >(j)(y — vAjz) 

r,r' — l 

Arguing as in (2.3)— (2.8) one arrives at the upper bound for (5.5) given by 

cM^|| 2 = c' 'fce 2 *(l - e- 2s )\\N 3 \\ 2 < c>'ke 2s (l e^l - ^f)" 1 , 
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using 



(5.6) 



\\N 3 \\ 2 = \\{l k - CovXj)-? || 2 = \\l k - CovXj\\- 1 , 

p fc - CovXA\ = sup u - (I fe - CovXj)u = sup (1 - ^(u.Xj) 2 ) 
|«|=i |«|=i 

> 1 - S|^| 2 > 1 - (£7|JCj| 3 )i > 1 - jSf 



and assuming 



(5.7) 



A < 1 



Proceeding as in Section 4 one arrives at the bound: 



(5.8) 



S n < ckz/3 3 . 



If one takes the absolute constant c > 1, then the j3 3 may be assumed to be smaller or equal to c k =, and 

2 

(1 — /3|) _1 < (1 h-)^ 1 = c '- The induction argument is similar. 

Remark: If one defines 

(5.9) 
then 



73 = x>GE>fi) 3 > 

3=1 i=l 



n k 



E E si*j 4) *j * 



' = 73, 



,7=1 i,i\i f/ — 1 

Since 73 now replaces fci/?3 in the computations, it follows that 

(5.10) S n < ckj 3 

Since, 73 < fci (3 3: (5.10) provides a better bound than (5.8) or (4.13). 
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